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Abstract. A model-independent global search for new physics has been performed at the CDF experiment. This search 
examines 399 final states, looking for discrepancies between observation and the standard model expectation in populations, 
kinematic shapes, bumps in mass distributions suggestive of new resonances and the tails of the summed transverse momentum 
distribution. This global search reveals no evidence of physics beyond the Standard Model in 2.0 fb^' of pp collisions at 
yG= 1.96 TeV. 
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INTRODUCTION AND STRATEGY 

A model-independent global search for new physics has 
the unique ability to complement conventional dedicated 
searches for physics beyond the Standard Model. Due 
to the current proliferation of extensions to the Standard 
Model, it is extremely difficult to decide among the large 
number of candidate models which one is the most fa- 
vored. Even within the Minimal Supersymmetric Stan- 
dard Model, which is arguably the most favored model in 
the particle physics field, there still remains the problem 
of covering the entirety of the 107 dimensional parameter 
space in any search meant to decide for certain whether 
supersymmetry is a true symmetry of nature. Thus, this 
global, model-independent search attempts to cover the 
large remaining regions of signature space, to ensure that 
no possible hint of new physics is missed. 

The overall strategy of this global search is to maxi- 
mize the scope of data that is analyzed, use our exper- 
tise of the Standard Model to implement a global Stan- 
dard Model prediction, and then to focus attention and 
scrutiny on genuine statistically significant outlying ef- 
fects. To achieve these goals, a predominantly Monte 
Carlo based Standard Model prediction is generated. 
Rather than adopting predefined signal and control re- 
gions, the entirety of the data are assumed to potentially 
contain the first signs of new physics. At the same time, 
the entirety of the data contribute information for con- 
straining the small number of physically well motivated 
correction factors, which modify the Standard Model 
prediction. These adjustments are designed to give an 
accurate accounting of the degree to which we under- 
stand systematic effects. Those discrepancies which per- 
sist after these adjustments are considered as possible 
candidates for evidence of new physics, and are subse- 
quently further investigated and scrutinized. Those can- 
didates for which one has reasonably ruled out an expla- 
nation in terms of a statistical fluctuation, a detector ef- 



fect, or background mismodelling, will provide the basis 
of claims of new physics. Further details of this global 
search may be obtained in a previous publication based 
on 927 pb-' ofdata|[ll]. 

VISTA 

This global analysis begins by identifying the objects 
of interest. Electrons, muons, photons, taus, jets, jets 
tagged as originating from a bottom quark, and missing 
transverse momentum f^r with sufficiently large trans- 
verse momentum (pr > 17 GeV) are identified and re- 
constructed using standard CDF criteria. A set of global 
event selection criteria is imposed in order to retain a 
wide spectrum of events at the highest transverse mo- 
menta for analysis. The events are partitioned into exclu- 
sive final states based on the objects that they contain. A 
large set of observables relevant to each final state is con- 
structed from kinematic variables. They include object 
transverse momenta, polar and azimuthal angles, angles 
between pairs of objects, masses of all combinations of 
objects, and a number of more specialized observables. 

To construct the Standard Model prediction, a diverse 
set of Monte Carlo generators are used to generate events 
to cover the entire spectrum of particle production at 
the Tevatron. They include dijet production, photon-njets 
production, diphoton production, production of a vector 
boson in association with jets, production of a pair of 
vector bosons, and top quark pair production. Extremely 
careful accounting of the multitude of backgrounds must 
be made in order to ensure that no backgrounds are 
missed and none are double counted. These background 
events are subsequently reweighted according to a set 
of coiTection factors fit from data to take into account 
systematic effects. They include the integrated luminos- 
ity, the k-factors of various Standard Model processes, 
trigger and reconstruction efficiencies, and a number of 
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FIGURE 1. Summary of the results of the global comparison 
of data to SM prediction for populations of final states (a) and 
shapes of kinematic distributions (b). The black line represents 
the theoretical expectation assuming no beyond-SM physics. 
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FIGURE 2. Significance of the most interesting bump in 
each mass variable. Each entry corresponds to one mass distri- 
bution found to contain at least one bump satisfying the quality 
criteria. 
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FIGURE 3. The three most interesting final states found by 
Sleuth in 2.0 fb" ' . The label in the top left corner of each plot 
shows the identified objects present in each final state, where 
is a lepton (e or /x), / ' is an additional lepton of different flavor, j 
denotes a jet and represents missing transverse momentum. 
The region identified as having the most significant excess of 
data over background is indicated by the blue line and displayed 
in the inset. The significance of the excess is shown by the value 
of ,0^ in the top right comer. 



fake rates [1]. The values of these factors are obtained 
by maximizing the global agreement with data in the 
populations of the exclusive final states, subject to exter- 
nal constraints from theory calculations and other exper- 
imental results. The degree to which any particular dis- 
crepancy may be resolved by an adjustment of the correc- 
tion factors simply reflects the degree to which system- 
atic effects are not well understood enough to constrain 
those factors. 

Having implemented the entire Standard Model pre- 
diction, it is possible to make a global comparison be- 
tween the data and the prediction. Using Poisson statis- 
tics and the Kolmogorov-Smirnov test, the Vista global 
comparison quantifies the degree to which the popula- 
tions of the exclusive final states and the shapes of the 
kinematic observables agree with the prediction. The re- 
sults are shown in Figure[T] After accounting for the trials 
factor associated with searching in multiple final states, 
we find no statistically significant discrepancy from the 
Standard Model prediction in the populations. Of the 
19650 kinematic observables that are investigated, 555 
actually do exhibit a statistically significant discrepancy. 
However, further investigation and analysis of those ef- 
fects reveal that they likely arise from a difficulty in mod- 
elling soft QCD jet emission in the underlying Monte 
Carlo events. In particular, they are attributed to the fact 
that the Pythia showering parameters are not particularly 
well tuned to the Tevatron data ^ . 

BUMP HUNTER 

Having gained confidence in the Standard Model back- 
ground prediction from the results of the Vista global 
comparison, we are able to use more specialized statisti- 
cal tools to look for new physics. An algorithm called the 
Bump Hunter searches in a total of 5036 mass variables 
to find local excesses of data over background suggestive 
of resonant production of new particles f?]. All mass dis- 
tributions are scanned in windows whose sizes are con- 
sistent with the detector resolution, implicitly assuming 
that the intrinsic width of the new particle is narrower by 
comparison, and the window with the most significant 
excess of data over background is selected. Certain qual- 
ity criteria are imposed to eliminate pathological cases. 
The results are summarized in Figure |2] The shift be- 
tween the expectation from pseudo-experiments and the 
observed data is due to the fact that this algorithm is sen- 
sitive to local mismodelling in the background, illustrat- 
ing the well known fact that the Monte Carlo events do 
not perfectly reproduce the data. There is one genuinely 
outlying excess, which warranted further scrutiny. This 
bump has a significance of 4.1(7 after accounting for the 
trials factor, and occurs in the mass of all jets in the 4-jet 
final state. The region that is selected coincides with the 



region in which the well established soft jet emission dis- 
crepancy lies. As a result, one concludes that the effect is 
likely due to the same underlying QCD parton showering 
issue and is unhkely to be new physics. 

SLEUTH 

Finally, an algorithm called SLEUTH attempts to find 
statistically significant excesses in the tail of the 
distribution. Each semi-infinite region beginning with 
each data point and extending to infinity are considered 
and the one exhibiting the most significant excess of data 
over background is selected in each final state. 

The three most significant SLEUTH final states are 
shown in Figure [3] The significance of the top SLEUTH 
discrepancy is 0.00055. After accounting for the trials 
factor, this significance translates to a final probability of 
8% that hypothetically similar CDF experiments would 
yield a final state that is as or more interesting than the 
most interesting final state found. 

CONCLUSIONS 

CDF has completed a model-independent global search 
for new physics in 2.0 fb^' of data. The populations of 
the 399 final states exhibit no statistically significant dis- 
crepancies relative to our Standard Model implementa- 
tion. Of the 19650 kinematic observables that are ex- 
amined, 555 show clearly significant deviations from the 
prediction. Most of these are found to be due to difficul- 
ties in modelling soft QCD jet emission and in tuning 
the Pythia showering parameters to the Tevatron data, 
which also explains the only significant outlier in the 
search for mass resonances in 5036 mass distributions. 
The small number of remaining shape discrepancies is 
understood to be due to the overall crudeness of the cor- 
rection model. Finally, the SLEUTH statistic, while point- 
ing out the interesting signature of same sign, opposite 
flavour dilepton events in the top three final states, re- 
vealed no statistically significant excess at high 
Unfortunately, this analysis is unable to make any dis- 
covery claims of new physics. 
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